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<110> APPLICANT: Duke University 
Lin, Haifan 

<120> TITLE OF INVENTION: PURIFIED AND ISOLATED piwi FAMILY GENES AND GENE 

PRODUCTS AND THERAPEUTIC AND SCREENING METHODS USING SAME 
<130> FILE REFERENCE: Attorney Docket No. 180-104/2 
<140> CURRENT APPLICATION NUMBER: 09/873, 737A 
<141> CURRENT FILING DATE: 2001-06-04 
<150> PRIOR APPLICATION NUMBER: PCT/US99/28764 
<151> PRIOR FILING DATE: 1999-12-03 
<150> PRIOR APPLICATION NUMBER: 60/110,901 
<151> PRIOR FILING DATE: 1998-12-04 
<160> NUMBER OF SEQ ID NOS : 21 
<170> SOFTWARE: Patentln Ver. 2.1 
<210> SEQ ID NO: 1 
<211> LENGTH: 3047 
<212> TYPE: DNA 
<213> ORGANISM 



Drosophila sp. 



CDS 
(84) 

misc. 
(120 



e 



. (2612) 



;_f ea£ure 




n=a or c, Xaa=Leu or lie 



ure 



n=a or t, Xaa=Leu or lie 



r eature 



or c, Xaa=Leu or lie 



<220> FEATURE: 

<221> NAME/KEY: 
<222> LOCATION: 
<220> FEATURE: 
<221> NAME/KEY: 
<222> LOCATION: 
<223> OTHER INFORMATION 
<220> FEATURE: 
<221> NAME/KEY: misc_fe 
<222> LOCATION: (399) 
<223> OTHER INFORMATION 
<220> FEATURE: 

<221> NAME/KEY: misc_f eajtfire 
<222> LOCATION: (2436) <^ 
<223> OTHER INFORMATION: n=a 
<400> SEQUENCE: 1 

ctgagtccaa agcgtcgttt tcaaagtact ctttcagttt ccattgtgaa gttttaagtg 60 
atcgcgagtg ccaaaaagta aca atg get gat gat cag gga cgt gga cgc agg 113 

Met Ala Asp Asp Gin Gly Arg Gly Arg Arg 
15 10 
cgt cca/Stt aac gaa gat gat tec tct act tec cga ggt agt ggt gat 161 
Arg ProTtaa Asn Glu Asp Asp Ser Ser Thr Ser Arg Gly Ser Gly Asp 

15 20 25 

ggg ccg egg gtg aaa gta ttc aga gga tct tea tea ggt gac ccg aga 209 
Gly Pro Arg Val Lys Val Phe Arg Gly Ser Ser Ser Gly Asp Pro Arg 

30 35 40 

gcg gat cct cgt ata gag get tea aga gag aga aga get etc gag gaa 257 
Ala Asp Pro Arg lie Glu Ala Ser Arg Glu Arg Arg Ala Leu Glu Glu 

45 50 55 

get ccc agg cgt gaa ggt ggc ccg cca gag cga aag ccg tgg ggt gac 305 



file://C:\CRF3\Outhold\VsrI873737A.htm 



10/23/01 



Page 2 of 7 



353 



RAW SEQUENCE LISTING DATE : 10/23/2001 
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68 Ala Pro Arg Arg Glu Gly Gly Pro Pro Glu Arg Lys Pro Trp Gly Asp 

69 60 65 70 

71 caa tat gat tac ctg aat acc cgt ccg gtt gag ctg gta tec aag aag 

72 Gin Tyr Asp Tyr Leu Asn Thr Arg Pro Val Glu Leu Val Ser Lys Lys 

73 75 80 85 

W -> 75 gga acc gat ggc gtc ccg gtc atg ctg cag acg aac ttt ttt cga "1 
W--> 76 Gly Thr Asp Gly Val Pro Val Met Leu Gin Thr Asn Phe Phe Arg Xaa 

77 95 100 105 

79 aaa acc aag ccg gaa tgg egg ate gtt cat tat cac gtg gag ttt gtg 

80 Lys Thr Lys Pro Glu Trp Arg He Val His Tyr His Val Glu Phe Val 

81 110 115 120 

83 ccg acc ate gag aat cct cgt gtc cgt atg gga gtt ttg tec aat cat 

84 Pro Thr He Glu Asn Pro Arg Val Arg Met Gly Val Leu Ser Asn His 

85 125 130 135 



449 



497 



545 



593 



87 get aac ctt ctg gga tea ggc tat eta ttc gac gga ctg caa ctg ttc 

88 Ala Asn Leu Leu Gly Ser Gly Tyr Leu Phe Asp Gly Leu Gin Leu Phe 

89 140 145 150 

91 acc acc agg aaa ttc gag cag gaa ate acg gtg etc age gga aag teg 

92 Thr Thr Arg Lys Phe Glu Gin Glu He Thr Val Leu Ser Gly Lys Ser 

93 155 160 165 l™ 

95 aaq ctg gac att gaa tac aag ata tec ata aag ttc gtt gga ttc ata 641 

96 Lys Leu Asp He Glu Tyr Lys He Ser He Lys Phe Val Gly Phe He 

97 175 180 i85 

99 teg tgt get gag ccc cgc ttt ttg caa gtc tta aat eta ata ttg cgc 689 

100 Ser Cys Ala Glu Pro Arg Phe Leu Gin Val Leu Asn Leu He Leu Arg 

101 190 195 200 

103 cgc teg atg aag ggc eta aat ttg gaa tta gtt ggc cgt aat etc ttt 737 

104 Arg Ser Met Lys Gly Leu Asn Leu Glu Leu Val Gly Arg Asn Leu Phe 

105 205 210 215 

107 gat ccc cga get aag ate gaa ata agg gag ttc aaa atg gag eta tgg 

108 Asp Pro Arg Ala Lys He Glu He Arg Glu Phe Lys Met Glu Leu Trp 

109 220 225 230 

111 ccg ggc tat gag aca teg att cgt cag cac gaa aaa gat att tta ttg 

112 Pro Gly Tyr Glu Thr Ser He Arg Gin His Glu Lys Asp He Leu Leu 

113 235 240 245 250 

115 gac acc gaa ata act cac aaa gtt atg cgc acc gag acg ate tac gac 

116 Gly Thr Glu He Thr His Lys Val Met Arg Thr Glu Thr He Tyr Asp 

117 255 260 265 

119 ata atg cga cgt tgc tea cac aat ccg get cgt cat cag gac gaa gta 

120 He Met Arg Arg Cys Ser His Asn Pro Ala Arg His Gin Asp Glu Val 

121 270 275 280 

123 egg gta aat gtt ttg gac ttg att gtc ctt acg gat tac aat aac aga 

124 Arg Val Asn Val Leu Asp Leu He Val Leu Thr Asp Tyr Asn Asn Arg 

125 285 290 295 

127 act tat cgt ate aat gat gtc gac ttt gga caa act ccg aaa tea aca 

128 Thr Tyr Arg He Asn Asp Val Asp Phe Gly Gin Thr Pro Lys Ser Thr 

129 300 305 310 

131 ttc agt tgc aag ggt aga gat ate agt ttc gtg gaa tac tat etc act 

132 Phe Ser Cys Lys Gly Arg Asp He Ser Phe Val Glu Tyr Tyr Leu Thr 



785 



833 



881 



929 



977 



1025 



1073 
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133 315 



320 325 330 

135 aaa tat aat ata cgc att cgc gac cac aat cag ccg ctg ctg att tec 1121 

136 Lys Tyr Asn He Arg He Arg Asp His Asn Gin Pro Leu Leu He Ser 

137 335 340 345 

139 aaa aat agg gac aag get eta aaa act aac get age gaa tta gtg gta 1169 

140 Lys Asn Arg Asp Lys Ala Leu Lys Thr Asn Ala Ser Glu Leu Val Val 

141 350 355 360 

143 eta att cct gag etc tgc cga gtg act ggg etc aat gee gag atg cgc 1217 

144 Leu He Pro Glu Leu Cys Arg Val Thr Gly Leu Asn Ala Glu Met Arg 

145 365 370 375 

147 tea aac ttt cag ctt atg cgt gee atg age agt tat acg cga atg aac 1265 

148 Ser Asn Phe Gin Leu Met Arg Ala Met Ser Ser Tyr Thr Arg Met Asn 

149 380 385 390 

151 ccc aaa caa cgc act gat cga ttg cgc get ttt aac cac cgt tta caa 1313 

152 Pro Lys Gin Arg Thr Asp Arg Leu Arg Ala Phe Asn His Arg Leu Gin 

153 395 400 405 410 

155 aac act cca gaa agt gtg aag gtc ttg aga gac tgg aac atg gaa ctg 1361 

156 Asn Thr Pro Glu Ser Val Lys Val Leu Arg Asp Trp Asn Met Glu Leu 

157 415 420 425 

159 gac aag aac gtc aca gaa gta caa ggc egg ata att gga cag cag aac 1409 

160 Asp Lys Asn Val Thr Glu Val Gin Gly Arg He He Gly Gin Gin Asn 

161 430 435 440 

163 ate gtg ttt cat aat gga aag gtt cct get gga gaa aac get gat tgg 1457 

164 He Val Phe His Asn Gly Lys Val Pro Ala Gly Glu Asn Ala Asp Trp 

165 445 450 455 

167 caa agg cac ttc aga gac caa agg atg ctt ace act ccg age gat ggc 

168 Gin Arg His Phe Arg Asp Gin Arg Met Leu Thr Thr Pro Ser Asp Gly 

169 460 465 470 

171 etc gat cgt tgg get gtc ate gcg ccg caa agg aat tec cat gaa etc 1553 

172 Leu Asp Arg Trp Ala Val He Ala Pro Gin Arg Asn Ser His Glu Leu 

173 475 480 485 490 

175 cga act eta ctt gac tct ttg tat aga gca get agt gga atg ggt ctt 1601 

176 Arg Thr Leu Leu Asp Ser Leu Tyr Arg Ala Ala Ser Gly Met Gly Leu 

177 495 500 505 

179 aga att cga age ccc cag gaa ttc ata att tat gat gat cgc act gga 1649 

180 Arg He Arg Ser Pro Gin Glu Phe He He Tyr Asp Asp Arg Thr Gly 

181 510 515 520 

183 act tat gtg aga gca atg gat gat tgt gtg cgc tea gat ccc aaa ctt 1697 

184 Thr Tyr Val Arg Ala Met Asp Asp Cys Val Arg Ser Asp Pro Lys Leu 

185 525 530 535 

187 ata tta tgc etc gta ccc aat gat aac gee gaa aga tac tea tea ate 174 5 

188 He Leu Cys Leu Val Pro Asn Asp Asn Ala Glu Arg Tyr Ser Ser He 

189 540 545 550 

191 aaa aag aga gga tac gtt gac agg gcg gtg cca act caa gtt gtg ace 1793 

192 Lys Lys Arg Gly Tyr Val Asp Arg Ala Val Pro Thr Gin Val Val Thr 

193 555 560 565 570 

195 ctt aaa acg ace aag aac cgt age ctt atg age att gee acc aaa ata 1841 

196 Leu Lys Thr Thr Lys Asn Arg Ser Leu Met Ser He Ala Thr Lys He 

197 575 580 585 



1505 
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199 
200 
201 
203 
204 
205 
207 
208 
209 
211 
212 
213 
215 
216 
217 
219 
220 
221 
223 
224 
225 
227 
228 
229 
231 
232 
233 
235 
236 
237 
239 
240 
241 
243 
244 
245 
W--> 247 
W--> 248 
249 
251 
252 
253 
255 
256 
257 
259 
260 
261 
263 



gca ate caa ctg aat tgc aag ttg gga 
Ala He Gin Leu Asn Cys Lys Leu Gly 
590 595 
eta ccc ttg tec gga ctg atg aca att 
Leu Pro Leu Ser Gly Leu Met Thr He 

605 610 
aca cga gat egg aag agg gee tac gga 
Thr Arg Asp Arg Lys Arg Ala Tyr Gly 

620 625 
eta cag caa aac tec acg tac ttc age 
Leu Gin Gin Asn Ser Thr Tyr Phe Ser 
635 640 
ttt gat gtg etc get aac ace ctt tgg 
Phe Asp Val Leu Ala Asn Thr Leu Trp 
655 

cgc caa tat caa cat gag cat agg aag 
Arg Gin Tyr Gin His Glu~ His Arg Lys 
670 675 
tat cga gac ggt gtg age tec ggc tct 
Tyr Arg Asp Gly Val Ser Ser Gly Ser 



tat aca ccc 
Tyr Thr Pro 

ggc ttt gac 
Gly Phe Asp 

gca ttg att 
Ala Leu He 
630 

aca gtc acg 
Thr Val Thr 
645 

ccg atg ata 
Pro Met He 
660 

ctg cca tct 
Leu Pro Ser 

eta aag cag 
Leu Lys Gin 



685 



690 



gaa gtc aag gac ate att gag aag ttg aaa act gaa 
Glu Val Lys Asp He He Glu Lys Leu Lys Thr Glu 
700 705 710 

cag eta age cca ccg caa tta get tat att gtg gta 
Gin Leu Ser Pro Pro Gin Leu Ala Tyr He Val Val 
715 720 725 

aac acg cgc ttc ttc etc aac gga caa aat cct ccg 
Asn Thr Arg Phe Phe Leu Asn Gly Gin Asn Pro Pro 

735 740 
gtt gat gac gtt ata act ctg ccc gag aga tac gac 
Val Asp Asp Val He Thr Leu Pro Glu Arg Tyr Asp 

750 755 
teg caa caa gtt cgt cag ggt aca gtg teg ccg ace 
Ser Gin Gin Val Arg Gin Gly Thr Val Ser Pro Thr 

765 /TV 70 
ctt tat age age atg ggt/ntc)tca ccg gag aaa atg 
Leu Tyr Ser Ser Met Gly^-Xaa Ser Pro Glu Lys Met 

780 785 790 

tac aag atg tgc cac ttg tac tac aat tgg teg ggc 
Tyr Lys Met Cys His Leu Tyr Tyr Asn Trp Ser Gly 
795 800 805 

cca gca gtt tgc cag tac get aag aag eta get ace 
Pro Ala Val Cys Gin Tyr Ala Lys Lys Leu Ala Thr 

815 820 
aac ttg cac tct att ccg caa aac gcg etc gaa aag 
Asn Leu His Ser He Pro Gin Asn Ala Leu Glu Lys 

830 835 
eta taattggata taatttagaa tggagtatta atccttacta 



tgg atg ate 
Trp Met He 
600 

att gcg aag 
He Ala Lys 
615 

gee tea atg 
Ala Ser Met 

gag tgc age 
Glu Cys Ser 

gca aag gee 
Ala Lys Ala 
665 

cga ate gta 
Arg He Val 
680 

ctt ttt gaa 
Leu Phe Glu 
695 

tac gee cgc 
Tyr Ala Arg 

ace aga tec 
Thr Arg Ser 

cct ggt act 
Pro Gly Thr 
745 

ttt tat ctg 
Phe Tyr Leu 
760 

age tac aat 
Ser Tyr Asn 
775 

caa aaa ctt 
Gin Lys Leu 

acc aca cga 
Thr Thr Arg 

etc gtg ggt 
Leu Val Gly 
825 

aag ttt tat 
Lys Phe Tyr 

840 
agaggecata 



gaa 
Glu 

age 
Ser 

gat 
Asp 

gec 
Ala 
650 
ctg 
Leu 

ttt 
Phe 

ttt 
Phe 

gtc 
Val 

atg 
Met 
730 
ata 
He 

gtc 
Val 

gtt 
Val 

acg 
Thr 

gtg 
Val 
810 
acg 
Thr 

tat 
Tyr 



1889 



1937 



1985 



2033 



2081 



2129 



2177 



2225 



2273 



2321 



2369 



2417 



2465 



2513 



2561 



2609 



2662 
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264 Leu _ 
266 tatgaaacta gcccagacat ttatactttt tcaatacttc cttacttttg ctaagcactt 2722 
268 cagcatttat gactaaatat tttgtatttg aaatgcatta ctgctctttt ttcaaacaaa 2782 
270 agcaaaattg aggattaaga ttctggtatt taagcataag accagaggaa attcccaaac 2842 
272 aaacatttaa agttatctat caagacatgt tcattaattt ggaatataat tactttattt 2902 
274 tttattgtat attttagttt atgtaaagaa aaattacata catccatgtt tgcttactta 2962 
276 accacacatt catggctgct tatattcgtg aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3022 
278 aaaaaaaaaa aaaaaaaaaa aaaaa 3047 

281 <210> SEQ ID NO: 2 

282 <211> LENGTH: 843 

283 <212> TYPE: PRT 

284 <213> ORGANISM: Drosophila sp . 
2 86 <220> FEATURE: 

2 87 <2 21> NAME/KEY: misc_f eature 

288 <222> LOCATION: (13) 

289 <223> OTHER INFORMATION: Xaa=Leu or lie 

291 <220> FEATURE: 

292 <221> NAME/KEY: misc_f eature 

293 <222> LOCATION: (106) 

294 <223> OTHER INFORMATION: Xaa=Leu or He 

296 <220> FEATURE: 

297 <221> NAME/KEY: misc_f eature 

298 <222> LOCATION: (785) 

299 <223> OTHER INFORMATION: Xaa=Leu or He 

301 <400> SEQUENCE: 2 S~?\ 
W--> 302 Met Ala Asp Asp Gin Gly Arg Gly Arg Arg Arg Pro *aa>sn Glu Asp 

S 10 15 



Asp Ser Ser Thr Ser Arg Gly Ser Gly Asp Gly Pro Arg Val Lys Val 
20 25 30 



303 
305 
306 

308 Phe Arg Gly Ser Ser Ser Gly Asp Pro Arg Ala Asp Pro Arg He Glu 

309 35 40 45 

311 Ala Ser Arg Glu Arg Arg Ala Leu Glu Glu Ala Pro Arg Arg Glu Gly 

312 50 55 60 

314 Gly Pro Pro Glu Arg Lys Pro Trp Gly Asp Gin Tyr Asp Tyr Leu Asn 

315 65 70 75 80 

317 Thr Arg Pro Val Glu Leu Val Ser Lys Lys Gly Thr Asp Gly Val Pro 

318 85 <«L 95 
W--> 320 Val Met Leu Gin Thr Asn Phe Phe Arg/Xaa )Lys Thr Lys Pro Glu Trp 

321 100 105^ y 110 

323 Arg He Val His Tyr His Val Glu Phe Val Pro Thr lie Glu Asn Pro 

324 115 120 125 

326 Arg Val Arg Met Gly Val Leu Ser Asn His Ala Asn Leu Leu Gly Ser 

327 130 135 140 

329 Gly Tyr Leu Phe Asp Gly Leu Gin Leu Phe Thr Thr Arg Lys Phe Glu 

330 145 . 150 155 160 

332 Gin Glu He Thr Val Leu Ser Gly Lys Ser Lys Leu Asp He Glu Tyr 

333 165 170 175 

335 Lys He Ser He Lys Phe Val Gly Phe He Ser Cys Ala Glu Pro Arg 



336 



180 185 190 
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L-29 M-283 W: Missing Blank Line separator, <220> field identifier 
L-55 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 
L-56 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 
L:75 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 
l!76 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 

L-.247 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 

L-248 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 

L-302 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 2 

L-320 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 2 

L-449 M-341 W: (46) "n" or "Xaa" used, for SEQ ID# : 2 

L-466 M:283 W: Missing Blank Line separator, <220> field identifier 

L-517 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L:518 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L-549 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L-550 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

l!593 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L-594 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L-701 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L-702 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L-800 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:4 

L:824 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:4 

L-854 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 4 

L-935 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 4 

L:952 M:283 W: Missing Blank Line separator, <220> field identifier 

L-996 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 5 

L:997 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 5 

L:1052 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 5 

L-1053 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 5 

Lill60 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 5 

Llll61 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 5 

L:1254 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 6 
L-1296 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 6 
L-1377 M-341 W: (46) "n" or "Xaa" used, for SEQ ID# : 6 
L-1409 M-283 W: Missing Blank Line separator, <220> field identifier 
L-1421 M:283 W: Missing Blank Line separator, <220> field identifier 
L-1433 M-283 W: Missing Blank Line separator, <220> field identifier 
L-1445 M-283 W: Missing Blank Line separator, <220> field identifier 
L-1457 M-283 W: Missing Blank Line separator, <220> field identifier 
L : 1469 M-283 W: Missing Blank Line separator, <220> field identifier 
l'-1481 M:283 W: Missing Blank Line separator, <220> field identifier 
L-1493 M:283 W: Missing Blank Line separator, <220> field identifier 
L : 1505 M-283 W: Missing Blank Line separator, <220> field identifier 
L : 1517 M:283 W: Missing Blank Line separator, <220> field identifier 
L-1529 M-283 W: Missing Blank Line separator, <220> field identifier 
L-1541 M:283 W: Missing Blank Line separator, <220> field identifier 
L-1553 M:283 W: Missing Blank Line separator, <220> field identifier 
L-1565 M-283 W: Missing Blank Line separator, <220> field identifier 
L-1576 M:283 W: Missing Blank Line separator, <220> field identifier 
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